CDS

Accession Number TCMCG075C04789
gbkey CDS
Protein Id XP_007041252.2
Location join(1076398..1076765,1076875..1076964,1077043..1077104,1077250..1077356,1077476..1077565,1077683..1077805,1078358..1078435,1079133..1079218,1079304..1079352,1079672..1079728,1079797..1079927,1080494..1080577,1080943..1081006,1081207..1081644)
Gene LOC18607161
GeneID 18607161
Organism Theobroma cacao

Protein

Length 608aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007041190.2
Definition PREDICTED: embryogenesis-associated protein EMB8 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description embryogenesis-associated protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
KEGG_ko ko:K07019        [VIEW IN KEGG]
ko:K13696        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGCCGGGAACCCCAACGTTCACATATCCTTCCATCCTTTCTATTTCTACAAAGGAAACTCCCTCTCAGCGTTCCGATTTGTTCCCAGAACCTCCGTTACTTCCCCCAATGGGCTGCGAAACGTTGGCGGCAAACGCCGCCGTCTCTCCCTACGATCTTCTCTTCCAAGCGCTCGCTCTCATCCCCGTCTCCCATTATTTCATGGCCGCTTTCTTACTATTTCTCATTTTTCTATACAATTTTCTCGAGATTCATTTCCTTCGCGATTTGCTCACTCTTTTCAGAGGCGACCCTGTTACCTTAACTTACAACTCTTGCTCCGACCTCTGCCAATCAGTCGTTGCCAAGTGTAAGATTCTTCACGGAAGGTACTCGGTGACTCCATGGCTTTCGAGTCCTCATCTTCAGACAGCTTTCTTGAGTATTTTTGGACGGGCTCCTCCGGTTACTTATAGACGGCATTTATTCCGTGCTTTGGATGGTGGAACAATTGCTTTGGATTGGCTAACTTATTCTGATGTTGTGGAAGGTACTTCCCGTGCAATTGATAGTTCCGCTGCTCTCAAAGGTGATAAAACTCCGATTATGATTGTGATTCCTGGCCTAACCAGTGATTCTGCTTCTGCTTATGTGAAGCATCTTGCCTTCAACATGGCAAGACAGGGTTGGAGTGTTCTTGTAAGCAATCACCGTGGGCTGGGAGGTGTATCACTTACGTCTGATTGCTTTTATAATGCTGGATGGACCGAGGATGTACGGAAAATCATTGACCACATACGTTGTGAATATCCAGAAGCTCCTCTATATGCTGTTGGAACTAGCATTGGTGCAAATATTCTGGTAAAATATCTTGGAGAGGATGGGGCTAATACTCCTCTTGTTGGGGCTGCAGCTATATGCTCTCCTTGGGACCTCTTGATATGTGATAGGTTCATCAACCGTAGACCTGTGCAGAAAATATATGACAGAGTGCTAACAGTTGGCCTGCAAGTTTATGCACAATTGCATCAGTCTATCTTGTCTCGTCTTGCAGATTGGGAGAGCATTAAAAAGTCAAATTCGGTTCGGGACTTTGACAACCATGCTACTCGAGTTCTTGGAAAATTTGAGACTGTGGATACATATTATAGGCGTTCAAGCAGTACTAATTATGTAGAAAACGTGTCAGTGCCTCTCCTCTGTATCAGTGCCTTGGATGATCCAGTGTGCACCAGTGAAGCTATTCCATGGGATGAATGTAGGGCCAATGAAAATATAATCCTAGCTACTGCGGCACATGGTGGACATCTGGCTTTCTATGAAGGGATAACGGCATCTAGCTTATGGTGGGTAAGAGCTGTTGATGAGTTCTTTGGTGTTCTACGCACTAGCCCATTTAGAAGGCAGAAGATCCAAGGTTCTACCTTGCCCAAGCCACTGCAATCTTCAATAGATCAGGGGCCTTATTTGAATGTTATGGGAGATGGAATGGTGGCAGCAGCGGGCAGTGAACCAAGAGACATTGTACCAGAAGACATGTCGAATGAGCATATGATTCATAGTAAGAAAGAAGAGGACACAATTTCAGATAAAGGAACAGGTCCTGACTTGACAGACAAAATATATTCTAACAAGCACATCATGAGGCAAGCAGAACAAAATGTCAAGGATTTGATTGTCCCTGTCCAAAGACGCGTTGATCAGCTCTCTCGCCGGAGTAGGCGATCAATCTGGTTGCTGGCATACATTGCCATTATAACAACTTGGCCGTTTGTCGGTTCTGTTCTTATCTCAGTTCTCAAGAGAAGGTTCAAAACTTTTGTACCGGCTACATTATTTAAAAAATAG
Protein:  
MPGTPTFTYPSILSISTKETPSQRSDLFPEPPLLPPMGCETLAANAAVSPYDLLFQALALIPVSHYFMAAFLLFLIFLYNFLEIHFLRDLLTLFRGDPVTLTYNSCSDLCQSVVAKCKILHGRYSVTPWLSSPHLQTAFLSIFGRAPPVTYRRHLFRALDGGTIALDWLTYSDVVEGTSRAIDSSAALKGDKTPIMIVIPGLTSDSASAYVKHLAFNMARQGWSVLVSNHRGLGGVSLTSDCFYNAGWTEDVRKIIDHIRCEYPEAPLYAVGTSIGANILVKYLGEDGANTPLVGAAAICSPWDLLICDRFINRRPVQKIYDRVLTVGLQVYAQLHQSILSRLADWESIKKSNSVRDFDNHATRVLGKFETVDTYYRRSSSTNYVENVSVPLLCISALDDPVCTSEAIPWDECRANENIILATAAHGGHLAFYEGITASSLWWVRAVDEFFGVLRTSPFRRQKIQGSTLPKPLQSSIDQGPYLNVMGDGMVAAAGSEPRDIVPEDMSNEHMIHSKKEEDTISDKGTGPDLTDKIYSNKHIMRQAEQNVKDLIVPVQRRVDQLSRRSRRSIWLLAYIAIITTWPFVGSVLISVLKRRFKTFVPATLFKK